Rega-Net: Retina Gabor Attention for Deep Convolutional Neural Networks
نویسندگان
چکیده
Extensive research works demonstrate that the attention mechanism in convolutional neural networks (CNNs) effectively improves accuracy. Nevertheless, few design mechanisms using large receptive fields. In this work, we propose a novel method named Rega-Net to increase CNN accuracy by enlarging field. To best of our knowledge, increasing field network requires size convolution kernel, which also increases number parameters. For solving problem, kernels resemble non-uniformly distributed structure inspired human retina. Then, sample variable-resolution values Gabor function distribution and fill these retina-like kernels. This allows essential features be more visible center position We further an module including Experiments achieves 79.96% Top-1 on ImageNet-1K for classification 43.1% mAP COCO2017 object detection. The increased up 3.5% compared baseline networks.
منابع مشابه
Cystoscopy Image Classication Using Deep Convolutional Neural Networks
In the past three decades, the use of smart methods in medical diagnostic systems has attractedthe attention of many researchers. However, no smart activity has been provided in the eld ofmedical image processing for diagnosis of bladder cancer through cystoscopy images despite the highprevalence in the world. In this paper, two well-known convolutional neural networks (CNNs) ...
متن کاملDeep Epitomic Convolutional Neural Networks
Deep convolutional neural networks have recently proven extremely competitive in challenging image recognition tasks. This paper proposes the epitomic convolution as a new building block for deep neural networks. An epitomic convolution layer replaces a pair of consecutive convolution and max-pooling layers found in standard deep convolutional neural networks. The main version of the proposed m...
متن کاملDeep Convolutional Neural Networks for pedestrian detection
Pedestrian detection is a popular research topic due to its paramount importance for a number of applications, especially in the fields of automotive, surveillance and robotics. Despite the significant improvements, pedestrian detection is still an open challenge that calls for more and more accurate algorithms. In the last few years, deep learning and in particular convolutional neural network...
متن کاملDeep Convolutional Neural Networks for Smile Recognition
This thesis describes the design and implementation of a smile detector based on deep convolutional neural networks. It starts with a summary of neural networks, the difficulties of training them and new training methods, such as Restricted Boltzmann Machines or autoencoders. It then provides a literature review of convolutional neural networks and recurrent neural networks. In order to select ...
متن کاملVery deep convolutional neural networks for LVCSR
Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance when embedded in large vocabulary continuous speech recognition (LVCSR) systems due to its capability of modeling local correlations and reducing translational variations. In all previous related works for ASR, only up to two convolutional layers are employed. In light of the recent success of very deep CNNs in imag...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Geoscience and Remote Sensing Letters
سال: 2023
ISSN: ['1558-0571', '1545-598X']
DOI: https://doi.org/10.1109/lgrs.2023.3270186